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We extend the well known characterization of \/P ws as the class of polyno- 
mials computed by polynomial size arithmetic branching programs to other 
complexity classes. In order to do so we add additional memory to the com- 
putation of branching programs to make them more expressive. We show 
that allowing different types of memory in branching programs increases the 
computational power even for constant width programs. In particular, this 
leads to very natural and robust characterizations of VP and VNP by branch- 
ing programs with memory. 

1. Introduction 

Arithmetic Branching Programs (ABPs) are a well studied model of computation in al- 
gebraic complexity: They were already used by Valiant in the VNP-completeness proof 
of the permanent [13] and have since then contributed to the understanding of arith- 
metic circuit complexity (see e.g. |10t [7]). The computational power of ABPs is well 
understood: They are equivalent to both skew and weakly skew arithmetic circuits and 
thus capture the determinant, matrix power and other natural problems from linear al- 
gebra [8j. The complexity of bounded width ABPs is also well understood: In a parallel 
to Barrington's Theorem [T], Ben-Or and Cleve [2] proved that polynomial size ABPs of 
bounded width are equivalent to arithmetic formulas. 

We modify ABPs by giving them memory during their computations and ask how this 
changes their computational power. There are several different motivations for doing 
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this: We define branching programs with stacks, that are an adaption of the nonde- 
terministic auxiliary pushdown automaton (NAuxPDA) model to the arithmetic circuit 
model. The NAuxPDA-characterization of LOGCFL has been very successful in the study 
of this class and has contributed a lot to its understanding. We give a characterization 
of VP - a class that is well known for its apparent lack of natural characterizations. In 
the Boolean setting graph connectivity problems on edge-labeled graphs that are similar 
to our ABPs with stacks have been shown to be complete for LOGCFL I14j . One 
motivation for adapting these results to the arithmetic circuit setting is the hope that 
one can apply techniques from the NAuxPDA setting to arithmetic circuits. We show 
that this is indeed applicable by presenting an adaption of a proof of Niedermeier and 
Rossmanith [9] to give a straightforward proof of the classical parallelization theorem 
for VP first proved by Valiant et al. [12J. 

Another motivation is that our modified branching programs in different settings give 
various very similar characterizations of different arithmetic circuit classes. This allows 
us to give a new perspective on problems like VP vs. VP WS , VP vs. VNP that are classical 
question from arithmetic circuit complexity. This is similar to the motivation that 
Kintali [B] has for studying similar graph connectivity problems in the Boolean setting. 

Finally, all modifications we make to ABPs are straightforward and natural. The basic 
question is the following: ABPs are in a certain sense a memoryless model of compu- 
tation. At each point of time during the computation we do not have any information 
about the history of the computation sofar apart from the state we are in. So what 
happens if we allow memory during the computation? Intuitively, the computational 
power should increase, and we will see that it indeed does (under standard complexity 
assumptions of course). How do different types of memory compare? What is the role 
of the width of the branching programs if we allow memory? In the remainder we will 
answer several of these questions. 

The structure of the paper is a follows: After some preliminaries we start off with ABPs 
that may use a stack during their computation. We show that they characterize VP, 
consider several restrictions and give a proof of the parallelization theorem for VP. Next 
we consider ABPs with random access memory, show that they characterize VNP and 
consider some restrictions of them, too. 

2. Preliminaries 

2.1. Arithmetic circuits 

We briefly recall the relevant definitions from arithmetic circuit complexity. A more thor- 
ough introduction into arithmetic circuit classes can be found in the book by Biirgisser [S]. 
Newer insights into the nature of VP and especially VP,^ are presented in the excellent 
paper of Malod and Portier [8]. 

An arithmetic circuit over a field F is a labeled directed acyclic graph (DAG) consisting 
of vertices or gates with indegree or fanin or 2. The gates with fanin are called input 
gates and are labeled with constants from F or variables X\ , X% , . . . . The gates with 
fanin 2 are called computation gates and are labeled with x or +. 
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The polynomial computed by an arithmetic circuit is defined in the obvious way: An 
input gates computes the value of its label, a computation gate computes the product 
or the sum of its childrens' values, respectively. We assume that a circuit has only one 
sink which we call the output gate. We say that the polynomial computed by the circuit 
is the polynomial computed by the output gate. The size of an arithmetic circuit is the 
number of gates. The depth of a circuit is the length of the longest path from an input 
gate to the output gate in the circuit. 

We also consider circuits in which the +-gates may have unbounded fanin. We call 
these circuits semi-unbounded circuits. Observe that in semi- unbounded circuits x -gates 
still have fanin 2. A circuit is called multiplicatively disjoint if for each x-gate v the 
subcircuits that have the children of v as output-gates are disjoint. A circuit is called 
skew, if for all of its x -gates one of the children is an input gate. 

We call a sequence (f n ) of multivariate polynomials a family of polynomials or poly- 
nomial family. We say that a polynomial family is of polynomial degree, if there is a 
univariate polynomial p such that deg(/ n ) < p(n) for each n. VP is the class of polyno- 
mial families of polynomial degree computed by families of polynomial size arithmetic 
circuits. We will use the following well known characterizations of VP: 

Theorem 2.1. ( 112L fffi) Let (/„) be a family of polynomials. The following statements 
are equivalent: 

I- (fn) € VP 

■2- (fn) is computed by a family of multiplicatively disjoint polynomial size circuits. 

3- (f n ) is computed by a family of semi-unbounded circuits of logarithmic depth and 
polynomial size. 

VP e is defined analogously to VP with the circuits restricted to trees. By a classical 
result of Brent |3j, VP e equals the class of polynomial families computed by arithmetic 
circuits of depth 0(log(n)). VP^ is the class of families of polynomials computed by fam- 
ilies of skew circuits of polynomial size. Finally, a family (f n ) of polynomials is in VNP, 
if there is a family (g n ) G VP and a polynomial p such that f n (X) = Ylee{o i}j>M 9n(e, X) 
for all n where X denotes the vector (X\, . . . , X q r n \) for some polynomial q. 

A polynomial / is called a projection of g (symbol: / < g), if there are values € 
F U {Xi, X2, ■ ■ .} such that f(X) = g(a\, . . . , a q ). A family (f n ) of polynomials is a p- 
projection of (g n ) (symbol: (f n ) < p (g n )), if there is a polynomial r such that f n < g r ^ 
for all n. As usual we say that (g n ) is hard for an arithmetic circuit class C if for every 
(f n ) € C we have (f n ) < p (g n ). If further (g n ) E C we say that (g n ) is C-complete. 

The following criterion by Valiant [T3] for containment in VNP is often helpful: 

Lemma 2.2 (Valiant's criterion). Let (j) : {0, 1}* — > N be a function in ^P/poly, Then 
the family (f n ) of polynomials defined by 

n 

fn= E <X e )II^ ei 

ee{0,l} n i=l 

is in VNP. 
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2.2. Arithmetic branching programs 

The second common model of computation in arithmetic circuit complexity are arith- 
metic branching programs. 

Definition 2.3. An arithmetic branching program (ABP) G is a DAG with two vertices 
s and t and an edge labeling mj:E->FU {X\, X2, ■ ■ ■}■ A path P = V1V2 ■ ■ ■ v r in G has 
the weight w(P) := nl=i w ( v i v i+l)- Let v and u be two vertices in G, then we define 

fv,u = ^ui(P), 
p 

where the sum is over all u-ii-paths P. The ABP G computes the polynomial /q = f s j. 
The size of G is the number of vertices of G. 

Malod and Portier proved the following theorem: 

Theorem 2.4. ([8]) (f n ) G VP^ S , iff (f n ) is computed by a family of polynomial 
size ABPs. 

Definition 2.5. An ABP of width k is an ABP in which all vertices are organized into 
layers Li,i G N, there are only edges from layer Li to Li + \ and the number of vertices 
in each layer Li is at most k. 

The computational power of ABPs of constant width was settled by Ben-Or and Cleve: 

Theorem 2.6. (^) (f n ) G VP e , iff (f n ) is computed by a family of polynomial size 
ABPs of constant width. 

3. Stack branching programs 
3.1. Definition 

Let S be a set called symbol set. For a symbol s G S we define two stack operations: 
push(s) and pop(s). Additionally we define the stack operation nop without any argu- 
ments. A sequence of stack operations on S is a sequence op 1 op 2 ■ ■ ■ op r , where either 
opi = o~Pi(si) for 6pi G {push, pop] and s, G 5 or opi = nop. Realizable sequences of 
stack operations are defined inductively: 

• The empty sequence is realizable. 

• If P is a realizable sequence of stack operations, then push{s)P pop(s) is realizable 
for all s G S. Also nop P and P nop are realizable sequences. 

• If P and Q are realizable sequences of stack operations, then PQ is a realizable 
sequence. 
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Definition 3.1. A stack branching program (SBP) G is an ABP with an additional edge 
labeling a : E — >• {op(s) \ op £ {push, pop}, s G S} U {nop}. A path P = V\V2 ■ ■ ■ v T in 
G has the sequence of stack operations cr(P) := cr(viV2)o-(v2Vs) . . . a{v r -\v r ). If a(P) is 
realizable we call P a stack-realizable path. The SBP G computes the polynomial 

/ G = J>(P), 

p 

where the sum is over all stack-realizable s-i-paths P. 

It is helpful to interpret the stack operations as operations on a real stack that happen 
along a path through G. On an edge uv with the stack operation o~(uv) = push(s) we 
simply push s onto the stack. If uv has the stack operation a(uv) = pop(s) we pop the 
top symbol of the stack. If it is s we continue the path, but if it is different from s the 
path is not stack realizable and we abort it. nop stands for "no operation" and thus 
as this name suggests the stack is not changed on edges labelled with nop. Realizable 
paths are exactly the paths on which we can go from s to t in this way without aborting 
while starting and ending with an empty stack. 

To ease notation we sometimes call edges e with <r(e) = push(s) for an s € S simply 
piis/i-edges. pop-edges and nop-edges are defined in the obvious analogous way. 

It will sometimes be convenient to consider only SBPs that have no nop-edges. The 
following easy proposition shows that this is not a restriction. 

Proposition 3.2. Let G be an SBP of size s. There is an SBP G' of size 0(s 2 ) such 
that fc = fc' an d G' does not contain any nop-edges. If G is layered with width k, then 
G' is layered, too, and has width at most k 2 . 

Proof. The idea of the construction is to subdivide every edge of G. So let G be an 
SBP with vertex set V and edge set E. Let a and w be the stack symbol labeling and 
the weight function, respectively. G' will have the vertex set V U {v e \ e £ E}, stack 
symbol labeling a' and weight function w' . The construction goes as follows: For each 
edge e = uv S E the SBP G' has the edges uv e ,v e v. We set w'(uv e ) := w(uv) and 
w'(v e v) := 1. If e is a nop-edge we set a'(uv e ) := push(s) and a'(v e v) = pop(s) for an 
arbitrary stack symbol s. Otherwise, both uv e and v e v get the stack operation a(uv). 
It is easy to verify that G' has all desired properties. □ 

3.2. Characterizing VP 

In this section we show that stack branching programs of polynomial size characterize VP. 
Theorem 3.3. (/„) G VP, iff (f n ) is computed by a family of polynomial size SBPs. 

We the two direction of Theorem 13.31 independently. 
Lemma 3.4. If (/ n ) is computed by a family of polynomial size SBPs, then (f n ) £ VP. 
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Proof. Let (G n ) be a family of SBPs computing (F n ), of size at most p(n) for a polyno- 
mial p. Observe that deg(G n ) < p{n), so we only have to show that we can compute the 
G n by polynomial size circuits C n . 

Let G = G n be an SBP with m vertices, source s and sink t. The construction of 
C = C n uses the following basic observation: Every stack-realizable path P of length i 
between two vertices v and u can be uniquely decomposed in the following way. There 
are vertices a, b, c G V(G) and a symbol s G S such that there are edges va and be 
with a(va) = push(s) and o~(bc) = pop(s). Furthermore there are stack-realizable paths 
P a b from a to b and P cu from c to n such that length{P a i,) + length{P cu ) = i — 2 and 
P = vaP a }jbcP cu . The paths P a fe and P cu may be empty. We define w(u, v, i) := J2p w{P) 
where the sum is over all stack-realizable s-i-paths of length i. 

The values w(v,u, i) can be computed efficiently with a straightforward dynamic pro- 
gramming approach. First observe that w(v,u,i) = for odd i. For i = we set 
w(v, u, 0) = for v 7^ u and w(v, v, 0) = 1. For even i > we get 

w(v,u, i) = w{v, a)w(a, b,j)w(b, c)w(c, u, i — j — 2), 

where the sum is over all s € S 1 , all j < i — 2 and all a, 6, c such that o~{va) = push(s) 
and a {be) = pop{s). With this recursion formula we can compute alNote that Kintali 
proved a similar result for the Turing machine setting. 

1 w(v,u,i) with a polynomial number of arithmetic operations. Having computed all 
w{v,u,i) we get f G = E ie [ m ] w(s,t,i). □ 

The more involved direction of the proof of Theorem 13.31 will be the second direction. 
To prove it it will be convenient to slightly relax our model of computation. A relaxed 
SBP G is an SBP where the underlying directed graph is not necessarily acyclic. To 
make use of cyclicity we do not consider paths in a relaxed SBP G but walks, i.e. vertices 
and edges of G may be visited several times. Realizable walks are defined completely 
analogously to realizable paths. Also the weight w{P) of a walk is defined in the obvious 
way. Clearly, we cannot define the polynomial computed by a relaxed ABP by summing 
over the weight of all realizable walks, because there may be infinitely many of them 
and they may be arbitrarily long. Hence, we define for each pair u, w of vertices and for 
each integer m the polynomial 

fu,v,rn ■= 

P 

where the sum is over all stack-realizable u-v- walks P in G that have length m. Further- 
more, we say that for each m the relaxed SBP G computes the polynomial fem '■= fs,t,m- 
The connection to SBPs is given by the following straight-forward lemma. 

Lemma 3.5. Let G be a relaxed SBP and m € N. Then for each m there is an SBP 
G' m of size m\G\ that computes fc,m- 
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Proof. The idea is to unwind the computation of the relaxed SBP into m layers. Let 
G = (V, E, w, a), then for each v G V the SBP G' has m copies {v±, . . . , v m }. For each 
uv <G E the SBP G' had the edges UiVi + \ for i € [m — 1] with weight u;(ujVj+i) := w(uv) 
and stack operation cr^j'Uj+i) := o~(uv). This completes the construction of G'. 

Clearly, G' indeed computes fG,m and has size m\G\. □ 

To prove the characterization of VP we show the following rather technical proposition: 

Proposition 3.6. Let C be a multiplicatively disjoint arithmetic circuit. For each v € V 
we denote by C v the subcircuit of C with output v and we denote by f v the polynomial 
computed by C v . Then there is a relaxed SBP G = (V,E,w,a) of size at most 2|C|(|C| + 
1) + 3(|C|) such that for each v € V there is a pair v-, v + G V and an integer m v < 4\C V \ 
with 

* fv — fv— ,v+ ,m v j and 

• there is no stack-realizable walk from V- to v+ in G that is shorter than m v . 

Proof. We construct G iteratively along a topological order of C by adding new vertices 
and edges, starting from the empty relaxed SBP. 

Let first v be an input of C with label X. We add two new vertices v-,v+ to G 
and the edge V-V + with weigth w(v-v + ) = X and stack-operation o{v-v + ) := nop. 
Furthermore, m v := 1. Clearly, none of the polynomials computed before change and 
the size of the relaxed SBP grows only by 2. Thus all statements of the proposition of 
fulfilled. 

Let now v be an addition gate with children u, w. By induction G contains vertices 
u-,u + ,w_,w + and there are m u ,m v such that f u _, u+ ,m u = fu and f w _ iW+tJnw = f w . 
Assume w.l.o.g. m u > m w . We add two new vertices v-,v+ to G. Furthermore, we 
add a directed path of length m u — m w with start vertex v s and end vertex vt to G. 
We add the edges v-v s , vtW-, u + v + and w + v + . All edges we add get weight 1. 

Furthermore, we set a(v^u^) := push(vu), o~(u+v+) := pop(vu), a(v-V s ) := push(vw) 
and a(w + v + ) := pop{vw) for new stack symbols vu and vw. All other edges we added 
are nop-edges. Finally, set m v := m u + 2. 

Let us first check that G computes the correct polynomials. First observe that the 
edges we added do not allow any new walks between old vertices, so we still compute all 
old polynomials by induction. Thus we only have to consider the realizable v_-ti + -walks 
of length m v . Each of these either starts with the edge i;_u_ or the edge v-v s . In the 
first case, because of the stack symbols the walk must end with the edge Thus the 

realizable t>_i> + -walks of length m v that start with V-U- contribute exactly the same 
weight as the realizable u_-n + -walks of length m u which is exactly /„ by induction. 
Moreover, every t>_i> + -waiks of length m v that start with v-v s first makes m u — m w 
unweighted steps to w- and ends with the edge w + v + . Thus, these walks contribute 
exactly the same as the stackrealizable W--w+ walks of length m v — 2 — (m u — m w ) = m w , 
so they contribute /„,. Combining all walks we get f v _ ;V+ , mv = f u + f w = f v as desired. 

We have m v = m u + 2 < 4\C U \ + 2 < 4|C„| where the first inequality is by induction 
and the second inequality follows from the fact that v is not contained in C u and thus 
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|Ci>| > \C U \. To see the bound on |G| let s be the size of G before adding the new edges 
and vertices. By induction s < 2(\C V \ — 1)(\C V \ — 1 + 1) + 3(|C„| — 1). We have added 
2 + m u — tti v + 1 vertices and thus G has now size s + 3 + m u — m v < s + 3 + m u . 
But we have m u < 4|C U | < 4|C„| and thus the number of vertices in G is at most 
2(|C B | - 1)|C„| + 3(\C V \ - 1) + 3 + 4\C V \ < 2|C|(|C| + 1) + 3(|C„|). This completes the 
case that v is an addition gate. 

Let now v be a multiplication gate with children u, w. As before, G already contains 
U-, u + ,w-,w + and there are m u , m v with the desired properties. We add three vertices 
V-, v + and Vi and the edges w_-u_, u + Vi, ViW- and w + v + all with weight 1. The new 
edges have the stack symbols a(v-u ) := push(vu), a{u + Vi) := pop(vu), a(viW~) := 
push(vw) and a(w+v+) := pop(vw) for new stack symbols vu and vw. Finally, set 
m v := m u + m w + 4. 

Clearly, no stack-realizable walk between any pair of old vertices can traverse u_, 
v + or Vi and thus these walks still compute the same polynomials as before. Thus 
we only have to analyse the t>_-v + -walks of length m v in G. Let P be such a walk. 
Because of the stack symbols vu and vw the walk P must have the structure P = 
v-U-PiUj r ViW-P2Wj r Vj r where Pi and P2 are a stack-realizable u_-n+-walk and a stack- 
realizable u>_-u> + -walk, respectively. The walk P is of length m v and thus Pi and P2 
must have the combined length m u -\-m w . But by induction Pi must at least have length 
m u and P2 must have at least length m w , so it follows that Pi has length exactly m u 
and P2 has length exactly m w . The walks Pi and P2 are independent and thus we have 
fv—,vj r ,m v — fu—,u+,m u fw—,w-\-,m w — fufw desired. 

The circuit C is multiplicatively disjoint and thus we have \C V \ = \C U \ + \C W \ + 1. It 
follows that m v = m u + m w + 4 < 4\C U \ + 4\C W | + 4 = 4|C„ | where we get the inequality 
by induction. The relaxed SBP grows only by 3 vertices which gives the bound on the 
size of G. This completes the proof for the case that v is an addition gate and hence the 
proof of the lemma. □ 

Now the second direction of Theorem 13.31 is straight-forward. 

Lemma 3.7. Every family (/ n ) € VP can be computed by a family of SBPs of polynomial 
size. 

Proof. Given a family (C n ) of multiplicatively disjoint arithmetic circuits of polynomial 
size, first turn them into relaxed SBPs of polynomial size and polynomial m with Propo- 
sition 13.61 and then turn those relaxed SBPs into SBPs with Lemma 13.51 It is easy to 
check that the resulting SBPs have polynomial size. □ 

3.3. Stack branching programs with one stack symbol 

It is easy to see, that the number of symbols used in SBPs can be lowered to 2 without 
loss of computational power and with only logarithmic overhead in the size (see also 
Section 13.41 Therefore the only meaningful restriction of the size of the symbol set 
is the restriction to a set only consisting of one single symbol. The following fairly 
straightforward lemma shows that doing so indeed decreases the computational power. 
Note that Kintali proved a similar result for the Turing machine setting. 
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Lemma 3.8. (/„) € VP W s if an d only if it can be computed by polynomial size SBPs 
with one stack symbol. 

Proof. The direction from left to right is easy: Simply interpret each edge e of an ABP 
G as a nop-edge. 

For the other direction the key insight is that if one has only one stack symbol one 
only has to keep track of the size of the stack at any point in the path. But this height 
can be encoded by vertices of an ABP. So let G be a SBP of size m. It is clear that the 
stack cannot be higher than m on any path through G. We construct an ABP G' that 
has for every vertex v in G the m + 1 vertices Vo, v\, . . . v m . If vu is a push-edge in G, 
we connect Uj to Ui + \ for i = 0, . . . , m — 1 in G' . If vu is a pop-edge in G, we add ViUi-i 
for i = 1, . . . , m to G' . All these edges get the same weight as vu in the G. It is easy to 
see that every stack-realizable path P in the SBP G corresponds directly to a path P' 
in the ABP G' and P and P' have the same weight. Thus G and G' compute the same 
polynomial. Moreover, \G'\ = (m + 1)|G| which completes the proof. □ 

3.4. Width reduction 

In this section we show that unlike for ordinary ABPs bounding the width of SBPs 
does not decrease the computational power: Polynomial size SBPs with at least 2 stack 
symbols and width 2 can still compute every family in VP. 

Lemma 3.9. Every family (/ n ) G VP can be computed by a SBP of width 2 with the 
stack symbol set {0, 1}. 

Proof. The idea of the proof is to start from the characterization of VP by SBPs from 
Theorem l3.31 We use the stack to remember which edge will be used next on a realizable 
path through the branching program. We will show how this can be done with width 2 
SBPs with a bigger stack symbol size. In a second step we will seee how to reduce the 
stack symbol set to {0, 1}. 

So let (G n ) be a family of SBPs. Fix n and let G := G n with vertex set V and edge 
set E. Furthermore, let w be the weight function, a the stack operation labeling and S 
the stack symbol of G. Let s and t be the source and the sink of the SBP G. We assume 
without loss of generality that s has one single outgoing edge e s . Furthermore t is only 
entered by one nop-edge e± with weight 1. We will construct a new SBP G' with weight 
function w' and stack operation labeling a'. G' will have stack symbol set S U E. For 
each edge e with a successor edge e' the SBPG" contains a gadget G e>e i. The vertex set 
of G e>e > is {vl e ,,v^ e ,,vl e ,,v^ el ,vl el ,v® el }. These vertices are connected to a DAG by 
the edges {v\ e ,v 2 ee ,, v\ fil v\ e „ v\ e ,v\ e ,, v\ y v 5 e y,vl e ,v 6 e e ,,v 5 e e ,v 6 e e ,}. All these edges have 
weight 1 except for v 2 ee ,v\ e , for which we set w' (v 2 e ,vf e ,) := w(e). We call v 2 ee ,v\ e i the 
weighted edge of G e y. Furthermore we set o~{v\ e ,v 2 J) := pop(e), cr(v 2 e ,v\ J) := er(e), 
°~( v i e /V e e') := P u sh(e'). All other edges are nop-edges. The construction of G e ^ e > is 
illustrated in Figure [TJ 

Now choose an order <e of E such that for each pair uv, vw £ E, the edge uv comes 
before vw. This order can be iteratively constructed from a topological order <y of V: 
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For each vertex v along <y iteratively add the edges entering v to <e as the new 
maximum. From <e we construct an order <g of the gadgets G e ^ by defining 

G ei ,e 2 <G Ge 3 ,e4, ^> ei < e 3 V (ei = e 3 A e 2 < e 4 ). 

We now connect the gadgets along the order <g in the following way: Let G ei>e2 and 
G e3i e 4 be two successors in <q. We connect v\ i e2 to Ve 3 ,e 4 by a nop-edge of weight 1. 
Let G efi i be the minimum of <q. We add a new vertex s and the edge sv^ e , with weigth 
1 and stack opeation a{sv^ ,) := push(e s ) where e s is the single outgoing edge of s in 
G. Let now G efi i be the maximum gadget in <q. We add a new vertex t and the edge 
v e>e 't with weight 1 and stack operation pop(et). This concludes the construction of G' . 

It is easy to see that G' has indeed width 2. Thus we only need to show that G and 
G' compute the same polynomial. This will follow directly from the following claim: 

Claim 3.10. There is a bijection tt between the stack-realizable paths in G and G' . 
Furthermore w(P) := w'(it(P)) for each stack-realizable path in G. 

Proof. Clearly every s-i-path must traverse all gadgets in G'. Furthermore, whenever a 
gadget is entered, the stack contains only one symbol from E which lies at the top of 
the stack. Through each gadget G e ^i there are exactly the two paths v\ e ,v 2 e e ,v^, e iv\ e , 

and ^ e'^ee'^ee'^e e'- We can * ne f° rmer the weighted path through G e>e i. For a stack- 
realizable s-i-path P = e±e2 ■ ■ ■ e& through G we define tt(P) to be the unique path 
through G' that takes the weighted path through exactly the gadgets G ei)£i+1 for i = 
1, . . . , k = 1. We have w(P) := w'(tt(P)) with this definition, because only the weighted 
edges in the gadgets have a weight different from 1 in G' . So it suffices to show that tt 
is indeed a bijection. 

We first show that n maps stack-realizable paths in G to stack-realizable paths in G'. 
So let P be as before. Observe that n(P) traverses the gadgets G e ^ ei+1 in the same 
order as P traverses the edges ej. Furthermore, whenever tt(P) enters a gadget G euei+1 
the top stack symbol is and the rest of the stack content is exactly that on P before 
traversing e^. When leaving G euei+1 the stack content is that after traversing on P 
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with an additional symbol on the top. Thus all stack operations along vr(P) must 
be legal and the stack is empty after traversing the last edge towards t. Thus vr(P) is 
indeed stack-realizable. 

Clearly, n is injective, so to complete the proof of the claim we only need to show that 
it is surjective. So let P' be a stack-realizable s-t-path in G' . Let G ei e > , ■ ■ ■ , G e . e > be 

111 ™ ! h 

the gadgets in which P' takes the weighted path in the order in which they are visited. 
We claim that e s e\ ... e/; is a stack-realizable s-t-path. Clearly, s is the first vertex of 
P. Also in P' the symbol e± is popped in the last step by construction of G' , so the 
last gadget in which P' took a weighted path must be one of the form G e et , because 
otherwise ej cannot be the top symbol on the stack before the last step. Thus t is the 
last vertex of P. 

To see that P is a path, observe that we have e! i = ej+j. Otherwise P cannot have 
the right top symbol when taking the weighted path in G e . +1>e i. i - Thus e^+i must be a 
successor of e, in G and P is an s-i-path. 

To see that P is stack-realizable observe that when P' traverses the weighted edge 
of a gadget G e „/ it has the same stack content as when P traverses ej in G. So P is 
obviously stack-realizable because P is. 

Observing that obviously w(P) = P' by construction completes the proof. □ 

In a final step we now reduce the stack symbol size to {0, 1} in a straightforward way. 
Let i := [log (IS 1 U E\)~\, then each stack symbol s can be encoded into a {0, l}-string 
fi(s) of length I. Now we substitute each edge e of G' by a path P e of length i. If 
o"'(e) = push(s) we the edges along P e are pus/i-edges, too, that push //(e) onto the 
stack. If cr'(e) = pop(s) we pop /x(s) in reverse order along P e . If e is a nop-edge, all 
edges of P e are nop-edges, too. Finally, we give one of the edges in P e the weight w'(e), 
while all other edges get weight 1. Doing this for all edges, it is easy to see that the 
resulting SBP computes the same polynomial as G'. Furthermore, its width is 2. □ 

3.5. Depth reduction 

In this section we show that the characerization of VP by SBPs allows us to directly 
use results from counting complexity that rely on NAuxPDAs. We demonstrate this by 
adapting a proof by Niedermeier and Rossmanith [S] to reprove the classical paralleliza- 
tion theorem for VP originally proved by Valiant et al. [12]. While neither the result 
nor the proof technique is new in itself, we argue that the use of applying the techniques 
using SBPs results in a proof that is arguably more transparent than any other proof of 
this classical theorem that we know. This raises our hopes that the SBP characterization 
of VP may be helpful in the future. 

We now start presenting the ideas of Niedermeier and Rossmanith in detail. The 
basic idea is the following: The realizable paths are recursively cut into subpaths and 
the polynomials are then computed by combining the polynomials of the subpaths. In 
order to reach logarithmic depth we have to make sure that the paths are cut in paths 
of approximately equal length to result in a balanced computation. This is complicated 
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by that fact that the paths have to be realizable, so we have to account for the content 
of the stack during the computation. 

We now give the details of the construction. Let G be an SBP and let P be a realizable 
path in G from a to b. Let c be a vertex on P, then the stack height of P in c is the 
number of push-edges minus the number of pop-edges on P from a up to c. Similarly to 
Niedermeier and Rossmanith we give to a path P a description (a,b,i), where a is the 
start vertex, b the end vertex and i the length of P. 

We define a relation h on paths in order to decompose them. Let P be a path with 
realizable subpaths Pi and P2 and let these three paths have the descriptions (a,b,i), 
(c, d,j) and (e, b, k). Then we write Pi, P2 h P iff 

• the stack height of P on e is 

• there is an s 6 S such that a{ac) = push(s) and a(de) = pop(s) and 

• j + k = i-2. 

We state a Lemma by Niedermeier and Rossmanith: 

Lemma 3.11. Let P be a path with description (a, b, i) and i > 2. Then there ex- 
ist uniquely described subpaths P\, P2 and P3 with descriptions (c,d,i\), (e, /, 12) and 
(g,d,is) with 12^3 < i/2 < ii suc/i t/iat P2,P3 H Pi. 

Lemma 13.111 allows us to cut a path P into three parts P2 , P3 and P — P%. None 
of these parts is too big, but we cannot iterate this procedure, because unfortunately 
P — Pi is not a path because it has a "gap" from c to d. To remedy the situation 
Niedermeier and Rossmanith formalize this notion of a path with gap in the following 
way: A path with gap with description (a,(c,d,j),b,i) consists of two paths, one from 
a to c and one from d to b, where i and j with i > j are even natural numbers. P is 
realizable, if identifying c and d results in a realizable path of length i — j. Observe that 
P with description (a, (c, d, i) , b, i) is realizable if and only if a = c and b = d, i.e. the 
path consists only of a gap. 

We now extend the relation h to paths with gaps. This is complicated a little by the 
fact that the gap can lie in either of the two subpaths that we want to split a path with 
gap into. So let P be a path with gap and description (a, (c, d, j), b, i). For the first case 
let Pi be a subpath with gap and description (e, (c, d, j), /, k) and P2 be a subpath with 
description (g,b,l). For the second case let Pi be a subpath with description (e,f,k) 
and P2 a subpath with gap and description (g, (c,d,j),b,l). Then we write P\,P2 H P 
if and only if the stack height g is 0, there is an s E S such that o~(ac) = push(s) and 
cr(cte) = pop(s) and k + l = i — 2. Observe that if c = d and j = this definition coincides 
with the definition on paths without gap. 

Niedermeier and Rossmanith give a version of Lemma 13.111 for paths with gap. 

Lemma 3.12. Let (a,(c,d,j),b,i) with i — j > 2 be a realizable path with gap. Then 
there exist uniquely determined paths Pi, P2 and P3 such that Pi has the description 
(e,(c,d,j),f,ii), P2,P3 I- Pi and eitfier 
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1. P 2 has the description (g,(c,d,j),h,i2) and P3 has the description (k,f,is) such 
that ii — j < (i — j)/2 <i\— j or 

2. P2 has the description (g,h,i2) and P3 has the description (k,(c,d,j),f,is) such 
that i 3 - j < (i - j)/2 < i\ - j. 

Let P be a realizable path with gap with description (a, (c,d,j),b,i). Then we define 
its weight w(P) := w(P') where P' is the realizable path we get from P when we 
identify c and d. Let w(a,b,i) := Ylp w (P) where the sum is over all realizable paths 
with description (a,b,i). Furthermore, w(a,(c,d,j),b,i) := ^2 P w(P) where the sum is 
over all realizable paths with gap with description (a, (c, d,j),b, i). With these definitions 
and the Lemmas 13.111 and 13.121 we get the following Lemma: 

Lemma 3.13. a) 

w(a,b,i) = ^2w(a, (c,d,j),b,i)w(e,f,ii)w(g,d,i 2 )w(ce)w(fg) 

where the sum is over all c,d,e,f,g 6 V(G) such that there is an s with a(ce) = 
push(s) and cr(fg) = pop(s) and all even numbers j,ii,i2 with 11,12 < i/2 < j and 
h+i2=j~ 2. 

b) 

w(a, (c, d,j),b,i) 

= w ( a > ( Cl ' dl ^i), b, i)w(e, (c, d,j),f, i x )w(g, d 1 ,i 2 )w(c 1 e)w(fg) 
+ ^w{a, (c ll d 1 ,j 1 ),b,i)w(e,f,h)w{g, (c, d,j),d 1 ,i 2 )w(c 1 e)w{fg) 

where both sums are over all ci,di,e, f, g G V(G) such that there is an s with 
(j(cie) = push(s) and cr(fg) = pop(s). The first sum is also over all even numbers 
ji ,11,12 with i\—j < (i — j)/2 < j% — j and i\ + i 2 = ji — 2, while the second sum is 
over all even numbers j±,ii,i 2 with i 2 — j < {i — j)/2 < j\ — j and i\ + i 2 = ji — 2. 

Proof. (Sketch) For a) oberve that the decomposition of Lemma 13.111 is unique. So we 
sum the weight of every path from a to b of length i exactly once. For b) use Lemma 
13.121 for the same argument. □ 

The following Lemma is now easy to see: 

Lemma 3.14. Let G be an SBP. Then fc can be computed by a semi-unbounded circuit 
of depth 0(log(|G|)) and size 

Combined with Theorem 13.31 we get the parallelization Lemma by Valiant et al. [12J. 

Corollary 3.15. Let (f n ) G VP. Then (f n ) can be computed by a family of semi- 
unbounded circuits of polynomial size and logarithmic depth in n. 
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4. Random access memory 



4.1. Definition 

We change the model of computation by allowing random access memory instead of a 
stack. We still work over a symbol set S like for SBPs but we introduce three random 
access memory operations: The operation write and delete take an argument s £ S while 
the operation nop again takes no argument. Let op{s) be a random access memory oper- 
ation with op G {write, delete] and P = opi op 2 ■ ■ ■ op r a sequence of memory operations. 
With occ(P, op(s)) we denote the number of occurences of op(s) in P. We call a sequence 
P realizable if for all symbols s € S we have that occ(P, write(s)) = occ(P, delete(s)) 
and for all prefixes P' of P we have occ(P' , write(s)) > occ{P' , deleters)) for all s E S. 

Intuitively the random access memory operations do the following: write(s) writes 
the symbol s into the random access memory. If s is already there it adds it another 
time, delete(s) deletes one occurence of the symbol s from the memory if there is one. 
Otherwise an error occurs, nop is the "no operation" operation again like for SBPs. A 
sequence of operations is realizable if no error occurs during the deletions and starting 
from empty memory the memory is empty again after the sequence of operations. 

Definition 4.1. A random access branching program (RABP) G is an ABP with an 
additional edge labeling a : E — > {op(s) \ op € {write, delete}, s S S} U {nop}. A path 
P = v\V2---v r in G has the sequence of random access memory operations <r(P) := 
o-{v\V2)o~{v2Vz) . . . o~(v r -iv r ). If o~(P) is realizable we call P a random-access-realizable 
path. The RABP G computes the polynomial 

f G = j2w(p), 
p 

where the sum is over all random-access-realizable s-t-paths P. 

In a completely analogous way to Proposition 13.21 we can proof that disallowing nop- 
edges does not change the computational power of RABPs. 

Proposition 4.2. Let G be an RABP of size s. There is an SBP G' of size 0(s 2 ) such 
that fc = fc 1 an d G' does not contain any nop-edges. If G is layered with width k, then 
G' is layered, too, and has width at most k 2 . 

4.2. Characterizing VNP 

Intuitively random access on the memory allows us more fine-grained control over the 
paths in the branching program that contribute to the computation. While in SBPs 
nearly all of the memory content is hidden, in RABPs we have access to the complete 
memory at all times. This makes RABPs more expressive than SBPs which is formalized 
in the following theorem. 

Theorem 4.3. (f n ) S VNP if and only if there is a family of polynomial size RABPs 
computing (f n )- 
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Again we prove the theorem in two independent lemmas, starting with the upper 
bound which is very easy. 

Lemma 4.4. If (/„) is computed by a family of polynomial size RABPs, then (f n ) G 
VNP. 

Proof. This is easy to see with Valiant's criterion (Lemma l2.2p and the fact that checking 
if a path through a RABP is realizable is certainly in P. □ 

We will now show the lower bound of Theorem 14.31 We will prove it directly for 
bounded width RABPs. To do so we consider the following dominating-set polynomial 
for a graph G = (V, E): 

DSP G (X 1 ,...,X n ) :=J2 Yl*v, 

D v<ED 

where the sum is over all dominating sets D in G. 

In Appendix [A] we show that the is a family (G n ) of graphs such that the resulting 
family (DSPc n ) of polynomials is VNP-complete. 

Lemma 4.5. For each family (f n ) S VNP there is a family of width 2 RABPs of poly- 
nomial size computing (f n )- 

Proof. We will show that for a graph G = (V, E) with n vertices there is a RABP of size 
and width 2 that computes DSPg(Xi, . . . ,X n ). The RABP works in two stages. 
The symbol set of the RABP will be V . In a first stage it iteratively selects vertices 
v and writes v and all of its neightbors into the memory. In a second stage it checks 
that each vertex v was written at least once into the memory, i.e., either v or one of its 
neighbors was chosen in the first phase. Thus the set of chosen vertices must have been 
a dominating set. 

So fix a graph G. For each vertex v with neighbors vi, . . . ,Vk we construct a gadget 
G v as shown in Figure [2j We call the path through G v with the edges that have memory 
operations the choosing path. Now for each vertex v we construct a second gadget G' v 
that is shown in Figure O Choose an order on the vertices. For each non-maximal vertex 
v in the order with successor n, we connect the sink of G v to the source of G u and the 
sink of G' v to the source of G' u with a nop-edge of weight 1. Finally, let x be the maximal 
vertex in the order and y the minimal vertex. Connect the sink of G x to the source of 
G' y again by a nop-edge of weight 1. 

We claim that G' computes DSPq- To see this, define the weight of a vertex set D in 
G to be w(S) := Y\ ve sX v . The following claim completes the proof. 

Claim 4.6. There is a bijection tt between dominating sets in G and RA-realizing paths 
in G' such that for each dominating set D in G we have w(D) := w(tt(D)). 

Proof. Observe that for RA-realizing paths through G' once the path through the gadgets 
G v is chosen, then rest of the path is fixed. So each RA-realizing path P can be described 
completely by the v for which the choosing paths through G v is taken. 
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Figure 2: The gadget G v . Let v be a vertex with neighbors v±, . . . ,Vd- The weight of 
XyXy IS X v while all other edges have weight 1. G v has two paths. Ev- 
ery realizable path that traverses G v on the upper path writes v and all of 
its neightbors into the memory. This path has weight X v . Realizable paths 
through the upper path do not change the memory in G v and have a weight 
weight contribution of 1 in G v . 




Figure 3: The gadget G' v . Let d be the degree of v, then G' v has d + 3 layers. All edges 
have weight 1. The edges connecting vertices in the lower level have operation 
delete(v) while all other edges have no memory operation. Every realizable 
path through G' v has weight 1 and deletes between 1 and d+1 occurences of 
the symbol v from memory. 



Let D be a dominating set. Let V be the set of s-t-paths in G' that for each v G D 
take the choosing path through G v and for each G' take the other path. Because D is 
dominating, after a path P G V has passed through the gadgets G v , it contains each 
symbol v G V at least once. Thus there is a unique path in V that is RA-realizing. Call 
this path tt(D). 

Obviously, ir is injective. To show that it is surjective, too, consider an RA-realizable 
path P in G'. Let D be the set of V G V for which P takes the choosing path. The 
path P passes every gadget G' v , so each element v G V gets deleted from the memory at 
least once. It follows that each v £ V must have been written to memory at least once 
before. So for v G V the path P must go through G v or through G u for a neighbor u of 
v. It follows that D is a dominating set. Furthermore, tt(D) = P, so ir is surjective. 

Finally, w(D) := w(ir(D)) is true, because the only weighted edges in G' are in the 
gadgets G v and for each v the weighted edge in G v has the weight X v . □ 
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Observing that G' has width 2, completes the proof. 



□ 
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A. VNP-completeness of the dominating-set polynomial 

In this appendix we show that there is a family of graphs such that the the polynomial 
family (DSPc n '■= YId YlveD ^v) is VNP-complete. With Valiant's criterion (Lemma 
I2.2p containment in VNP is clear. 

For hardness we will reduce from the polynomial VCPq = Y^sTlveS -^v where the 
sum is over all vertex covers S of G. This polynomial was introduced by Briquel and 
Koiran [3] who showed the following hardness result: 

Lemma A.l ([I]). There exists a family G n of polynomial size graphs such that (VCPg„) 
is V 'NP '-complete. 

With Lemma I A. II it suffices to show that for every graph G there is a graph G' of size 
polynomial in the size of G such that 

VCP G < DSP G i. 

So let G = (V, E) be a graph. We construct G' by adding for each e = uv € e a vertex 
v e and the edges uv e and vv e . Every dominating set D of G' must contain v e or one of 
u, v. Thus D is either a vertex cover of G or it contains a vertex v e for an e £ E. Setting 
X Ve := one gets VCPq as the projection of DSPq 1 ■ This finishes the proof. 
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